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Abstract. We generalize a theorem of Nymann that the density of 
[ • points in Z'* that are visible from the origin is l/("(d), where ("(a) is the 

Riemann zeta function Xii^i 1/^"- A. subset S C Z'' is called primitive 
if it is a Z-basis for the lattice Z"* fl span[((5'), or, equivalently, if S can 
' be completed to a Z-basis of Z"*. We prove that if m points in Z"^ are 

, chosen uniformly and independently at random from a large box, then 

. ' as the size of the box goes to infinity, the probability that the points 

' form a primitive set approaches l/[({d)C,{d — 1) • • • ({d — m + 1)]. 



1. Introduction 



I A classic result in number theory is that, if a point in I? is chosen "at 

■ random," the probability that the point is visible from the origin (that is, 

O . not hidden by another point in Z^) is where is the Riemann zeta 

^ . function YliZi ^ (see P for a proof using Euler's totient function). More 

precisely, for a given n, if we choose an integer point (a, b) uniformly at 
random from the box [— n, n] x [—n, n] and compute the probability that 
(a, b) is visible from the origin, then as n approaches infinity, this probability 



approaches ^^j. 



s 

■ J.E. Nymann generalized this result to higher dimensions |7j: if a point in 

^ . Tj'^ is chosen at random, then the probability that the point is visible from 

the origin is This theorem is true for d > 2 and is, in effect, true for 

d = 1: the only points in that are visible from the origin are ±1, so the 
probability is 0, and diverges so that = 0. 

An obvious way to restate the condition that a point s = (ai, 02, . . . , a^) G 
Z"^ is visible from the origin is that gcd(ai, . . . , ad) = 1- We will restate the 
condition in a lattice theoretic context, so that it may be generalized to 
picking more than one point in Z"^. A point s is visible from the origin if 
and only if {s} is a Z-basis for the lattice spanjj(s) nZ'^. In general, given a 
set 5 = {si, S2, • • • , s-m} C Z'^, where 1 < m < d, we say that S is primitive 
if 5 is a Z-basis for the lattice span]g(S') Pi Z*^. An equivalent definition |H] 
is that S is primitive if and only if S can be completed to a Z-basis of all 
of Z*^. 
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In this paper we prove that if S is chosen "at random," then the proba- 
bihty that S is primitive is 

1 

C{d)C{d-l)---C{d-m + l)' 
To be precise, we prove the following theorem. 

Theorem 1. Let d and m be given, with m < d. For n S Z+, 1 < k < m, 
and 1 < i < d, let bn^k,i £ ^- For a given n, choose integers Ski uniformly 
(and independently) at random from the set bn,k,i < •Sfci < ^n,fc,i + it-- Let 
Sk = (sfci, . . . , Skd) and let S = {si, S2, ■ ■ ■ , Sm}- 

If\bn.k,i\ is bounded by a polynomial in n, then, as n approaches infinity, 
the probability that S is a primitive set approaches 

1 

C{d)C{d-l)---C{d-m + l)' 

where C{a) is the Riemann zeta function "^^i^i ^■ 

When m = 1, this theorem gives the classic result (d = 2) and Nymann's 
result. Note also that, if m = d and we choose S of size m, then the 
probability that S is primitive (i.e., that it is a basis for Z*^) approaches 
zero. This agrees with the theorem in the sense that we would expect the 
probability to be 

1 

c(d)c{d-i)---aiy 

but C(l) does not converge. 

The statement of the theorem uses more general boxes than [—n,n]'^ to 
pick the Sk from. We do this because the more general result is needed in jl]. 
That paper was the original inspiration for this theorem: we discovered it in 
an attempt to prove a fact in computational biology and Bayesian network 
theory. Since the concept of primitive sets is important in the geometry 
of numbers, we are proving this theorem in this separate paper. Note that 
some bound on the bn,k,i in terms of n is needed; otherwise one could use 
the Chinese Remainder Theorem to construct arbitrarily large boxes from 
which no primitive sets could be selected (even for d = 2, m = 1). 

In Section 2, we present an outline of the proof. The outline is a full 
proof in every respect, except that we ignore the error estimations in our 
probabilities. In that sense, it is the "moral" proof of the result. In Section 
3, we fill the holes by proving that the error estimates approach zero as n 
approaches infinity. The methods in Section 3 are themselves of interest, 
using concepts from triangulations of point sets, the metric geometry of 
polytopes (cross-sections of d-cubes), analytic number theory (consequences 
of the Prime Number Theorem), and the geometry of numbers. 

2. Outline of the proof 
We proceed by induction on m. 
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If m = 0, the theorem is trivially true. Assume that the theorem is true for 
m — 1, and we will prove it for m. The probability that S = {si, S2, ■ ■ ■ , Sm} 
is primitive is the product 

Probp„ ({si, . . . , Sm-i} is primitive) 

• Probp^ (5 is primitive, given that {si, . . . , Sm-i} is primitive), 

where Vn is the probability distribution, for a given n, from which we are 
choosing S. The first term in the product approaches 

1 

C{d)C{d-l)---C{d-m + 2)' 
as n ^ oo, by the inductive hypothesis, so we must show that the second 
term approaches ^^r^^qrjy- 

Indeed, suppose {si, . . . , Sm-i} is given and is primitive, and we choose 
Sm = {smi, ■ ■ ■ , Smd) (independently from the other Sj) according to the 
probability distribution Vn- Let A be the (m — 1) x d integer matrix whose 
rows are si, . . . , Sm-i- We will need the following lemma, to find a simpler 
matrix whose rows also form a primitive set. 

Lemma 2. Let A be a matrix in Z^^"?, and let U be a unimodular matrix 
(i.e., det{U) = ±1) in Z"?^^. The rows of A form a primitive set if and only 
if the rows of AU also form a primitive set. 

Proof. Suppose the rows of A form a primitive set. Let a G Z'' be in the 
M-span of the rows of AU, that is, a = xAU, where x is a matrix in M"^^^. 
In order to show that the rows of AU form a primitive set, we must show 
that X is actually integral. Indeed, aU~^ = xA G Z'' is in the R-span of the 
rows of A, and since the rows of A form a primitive set, x must integral. 
This also proves the converse, as U~^ is unimodular and A = {AU)U~^. □ 

The matrix U we will choose is a matrix that puts AU into Hermite 
normal form. 

Definition 3. A matrix B £ IF^'^ is in Hermite normal form if 

(1) Bij = for allj > i, 

(2) Bii > for all i, and 

(3) < Bij < Bii for all j <i. 

Given any integer matrix B of full row rank, there exists a unimodular 
matrix U such that BU is in Hermite normal form (see, e.g., 0; U will not, 
in general, be unique). This fact, together with the following lemma, gives 
a convenient characterization of when S* is a primitive set. 

Lemma 4. Let {si, . . . , s-m-i} C Z'^ 6e a primitive set, and let Sm S Z"^ be 
given. Let A be the (full row rank) matrix with rows si, . . . , Sm-i, and let U 
be a matrix such that AU is in Hermite normal form. Let U^'^^ be the i-th 
column of U . Then {si, . . . , Sm} is a primitive set if and only if the SmU^^\ 
for m < i < d, are relatively prime. 
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Proof. By Lemma |21 the rows of AU form a primitive set. It follows that 
{AU)ii = 1, for 1 < i < m — 1 (otherwise Ci, the i-th standard basis vector, 
would be in the M-span of the rows of AU, but not in the Z-span). Then, 
from the definition of Hermite normal form, (AU)ij = for i ^ j. Let 
A' be the matrix with rows si, . . . ,Sm (that is. A' is A with the additional 
row Sm appended). By Lemma |2l {si,...,Sm} is a primitive set if and 
only if the rows of A'U form a primitive set. We see that this is true 
if and only if the {A'U)mi, for m < i < d, are relatively prime (indeed, 
the index of the lattice span^jsi, . . . , Sm} within n span]g{si, . . . , Sm} 
is gcd{{A'U)mi '■ m < i < d}). Since {A'U)mi = SmU^^\ the lemma 
follows. □ 



Let // : Z-|_ {—1,0, 1} be the Mobius function defined to be 
^^[D)-- 



(— 1)* if L) is the product of i distinct primes, 
if D is divisible by the square of a prime. 



Given D G let pnD be the probability that D divides SmU^'^'^ for all 
m < i < d. Note that pnD is independent of our choice of U, because, 
as we noted in the proof of Lemma El gcd{smU^^^ : m < i < d} is the 
index of the lattice span^jsi, . . . , Sm} within Z'^ n spauj^jsi, . . . , Sm}, which 
is independent of U. Then, using inclusion-exclusion, the probability that 
the SmU^^\ for m < i < d, are relatively prime is 



nD- 

D=l 

We expect each pnD to be approximately in Section 3, we 

will show that 



(1) lim V ii{D)pnD = V /.(D)D-('^— +1) 

n— >oo ■'— ' ^ — ' 



D=l D=l 



Given that we have verified the following lemma (applied to a 
d — m + 1) finishes the proof of the theorem. 

Lemma 5. For any integer a > 2, 

oo -. 
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Proof. Since a > 2, the sum is absolutely convergent, and we have that 

oo 

D=l p prime 

1 



n i_p- 

p prime 



p prime 
1 



oo 

i=l 
1 



C(a) 

□ 

3. Error Estimates 

The remaining piece of the proof is to demonstrate Equation that is, 
that 



J2 KD)PnD - X; KD)D^ 







D=l D=l 

as n — > oo. 

We will need a bound on the entries of U, which the following lemma will 
help us get. 

Lemma 6. Given a rank p matrix A G W^'^ and a bound Mq such that 
\Aij\ < Mq for all i,j, there exists a unimodular matrix U such that 

(1) AU is in Hermite normal form and 

(2) \Uij\ <p\qMl for allij. 

Proof. Let B be the q x q matrix obtained be appending to A the rows 
ei, 62, . . . , eg_p (where is the i-th standard basis vector). Without loss 
of generality, we can assume that i? is a nonsingular matrix (otherwise, we 
could have appended different Cj). Let U he a unimodular matrix such that 
BU is in Hermite normal form. Note that AU is also in Hermite normal 
form. 

We will use the fact that 
(2) U = B-\BU) = ad]{B){BU), 
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where adj(i?) is the adjugate (classical adjoint) of B, in order to bound the 
entries of U. Since BU is lower diagonal, 

Q 

\det{B)\ = det{BU) = lliBU)ii. 

i=l 

Therefore {BU)ii < |det(i?)| for all i, and, by the definition of Hermite 
normal form, we conclude that {BU)ij < |det(i?)| for all 

Since the first p rows of B have entries bounded by Mq and the remaining 
rows are standard basis vectors, the entries of adj(-B) are bounded by plM^. 
Combining these two bounds, we see that the entries of adj f7) are 

bounded by q ■ p\M^ ■ |det(B)|. Using © we conclude that 

for all as desired. □ 

Since the absolute value of the entries of A are bounded by the bn,k,i + n, 
which we assume to be bounded by a polynomial in n, Lemma IHl shows 
that the unimodular matrix U can be chosen such that the absolute value 
of each entry of U is bounded by a polynomial in n. This in turn implies 
that |srrt?7^*^| is also bounded by a polynomial in n (where [7^*^ is the i-th. 
column of U). Let M = M(n) be our bound on \sm ; say M is 0{n ) 
for some k. Clearly, for D > M, pno = 0. 

We have that 



D=l 
< 



(3) 



D=l 



D=l 



+ 



M 



nD 

D=n+l 



+ 



nD 



D=M+1 



+ 



(d-m+l) 



n 

- E 1^"^ 

D=l 



D=n+1 
M 



oo 

E 

D=n+1 D=n-irl 



Of the three nonzero terms in the last expression, ^"^^^^i 
certainly converges to zero as n approaches infinity, so it suffices to show 
that the first two terms, X]d=i \PnD — | and X]^=„+iPn_D, do as 

well. We break our error computation into these two cases. 

Before we handle the two error sums in Lemmas [7| and |H1 we set some 
common terminology. Let Bn be the d-dimensional box of integers {sm S 
: bn,m,i < Smi < bn,m,i + n, for all i}, which is the box from which Sm is 
chosen with uniform probability. Given D G Z+, let A/) C be the lattice 
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of integer vectors x G Z'^ such that D divides x ■ U^'^\ ioi m < i < d. A^j is 
a sublattice of IJ^ of index D'i-^^+i. Let S„d = BnnAD. Then 



(4) PnD 

Lemma 7. As defined above, 



\SnD\ 



D=l 

converges to zero as n ^ oo. 

Proof. Suppose 1 < D < n. Let Ld C Z'' be the lattice of integer vectors 

(xi, . . . ,Xd) G such that D divides each Xj. Ld is a sublattice of Z'^ of 
index D'^. In fact, we see that L^j is a sublattice of Ajj, and therefore its 
index in Ad is D<^/D<^-'^+^ = D""'^. 

This means that if we look at any D x ■ ■ ■ x D cube, C = {(xi, . . . , x^^) G 
Z'^ : < Xj < Tj + D} for some rj G Z (that is, a translate of a fundamental 
parallelepiped of Ljj), then C contains exactly D"'- '^ elements of A^). Since 
Bn can be covered by such boxes, we have that I^dI < -D"*~^(;g+1)'^, 

and so 



Pnd < 



n" 



k=0 



n 



d-k 



Similarly, (-g — l)'' disjoint D x ■ ■ ■ x D cubes can be placed inside H„, and 

so 



Pnd > 



D 



m—l( n 



D 



1) 



k=o 



E M) 



\d-k 



D 



m—l—k 



n' 



d-k 



Combining these two inequalities and moving the k = d summand to the 
left-hand side, we see that 



Pnd 



]jd—m+l 



d-l 
k=0 



d\ D'"-!-^ 



n' 



d-k 



and so 



E 

D=l 



IPnD 



jj-{d-m+l) \ ^ ^ 



k=0 



n 



-{d-k) 



D=l 



which converges to zero as n ^ oo, proving the lemma. 
Lemma 8. As defined above, 

M 
D=n+1 

converges to zero as n — ^ oo . 



□ 
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Proof. Let 

A/ 

Tn = \^ SnD- 

D=n+1 

Let Nn be the maximum, over all Sm ^ Bn, of 

: n< D < M and G Sno}- 

Then 

M M 

PnD = n~''' ^ |5„d| 
D=n+1 Z)=n+1 

We need to approximate A^^^ and |T„| . We will repeatedly use the following 
fact (see PP, p:294), which can be derived from the Prime Number Theorem: 
for any e > and for any r < M, the number of factors of r is 0{rf ) (more 
precisely, for any (5 > and sufficiently large r, the number of factors of r is 
less than r'^^"'"'')'°s^/'°s^°^''; now we use that r < M is 0{n^) for some k). 

Claim 1: Nn is 0{rf). 

This follows immediately, as any element of the set 
{D : n< D < M and Sm G Sno} 
must be a factor of, say, SmU^"^\ and this number has 0{'nf) factors. 



Claim 2: |T„| is 0(n<^-2+^). 

Let a = gcd([/f ^ : m<i<d), where C/('"+^) , . . . , U^'^'i are the last 

d — m + 1 columns of U . Let R be the set of integers greater than n that are 
factors of at least one of a, 2a, 3a, . . . , [y^Ja. Each of the [\/raJ numbers 
i ■ a such that 1 <i < [\/raJ has O(n^) factors, so \R\ is 0{n'^^^). 

We divide T„ into two parts. Let 



tnl — 1^ SnD 



and let T„2 = Tn\Tni- We will show that both |T„i| and |T„2| are 0(n 
and so it will follow that \Tn\ = \Tni\ + \Tn2\ is also 0{n'^^^^''). 
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Claim 2a: |r„i| is 0{n'^~2+^). 
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Given a D G R, we want to estimate how large SnD is. Suppose first that 
conv(5nD) is a full dimensional polytope in Z*^, that is, its affine hull is all of 
M'^. Triangulate conv(S'n_D) into at least \SnD\ — d simplices whose vertices 
are in SnD (this can always be done, see for example |3|). Each simplex in the 
triangulation has volume at least because the lattice A„ (which 

includes every point in Sno) has index in Z"^. But conv(5nD) has 

volume at most n*^, because it lies in Bn- Putting this together, 



and so 



'tn—l 



\SnD\ <d + d\—, — -- <d + d\n" 

which is 0{n"^~^). 

On the other hand, if conv{SnD) is not full dimensional, then let k < d— 1 
be its dimension, and let H be the fc-dimensional affine space such that 
SnD C H. The fc-dimensional Euclidean volume H D Bn is at most 
\/2 n^, as proved in [2]. Again we can triangulate SnD into at least 
\SnD\ — k simplices that are /c-dimensional. The best we can know this time 
is that each simplex has volume at least ^. Putting this together, 

li\SnD\-k)<V2'-''n'', 

and so \SnD\ is 0{n^). 

In either case, \SnD\ is 0{n'^^^), and since \R\ is 0(n2+'^), |r„i| is 0{n'^~^- 



Claim 2b: jT^al is 0{n'^-^+^). 

Recall that a = gcd(C/|*^ : m < i < d). Without loss of generality, we 

may assume that U^'^ = a and U^'^ = 0, for m + 1 < i < d (if not, we may 
perform elementary column operations on the last d — m+ \ columns of U 
in order to put them in that form; the matrix AU will remain in Hermite 
normal form, because the last d — m + l columns of AU are all zeros). Note 
that a < M. 

Now suppose Sm2, Sni3, ■ ■ ■ , S-md are given, such that bn,m,i < Smi < bn,m,i + 

n. Given j such that 6n,m,i 1^ 3 < &n,m,i + define 

^^■'^ — (j) Sni2i SmZi . . . ) Smd)- 

We will show that 0(112+*^) of the t^^^ are in Tn2 (for given Sm2, ■ ■ ■ , Smd)- 

Since = 0, s' := is independent of j. If t^^^ G SnD 

for a particular D, then D must be a factor of s', which has 0(n'^) factors. 
Therefore there are only 0(n'^) possible D for which any of the t^^^ could be 
a member of SnD- 
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Now let us consider, for a given D ^ R, how many of the t(^) could be 
in SnD- If i^^'^ and t^'^) are in SnD, then D divides tO)C/M and t(^)C/M. 
Therefore D divides the difference t^-?) — which is {j — k) ■ a, 

since C/j'"'' = a. Since D ^ R, D does not divide a, 2a, . . . , [-^/njo, and so 
\j — k\ > ^Jn. Therefore the number of j such that t^^^ S SnD is at most 
nj \/n = ^/n. 

Since there are 0{rf) possibilities for D, and since, for a given D ^ R, 
the number of t^-'^ in SnD is 0(n2 ), we conclude that 0{n2~^'') of the t^-'^ are 
in Tn2- 

Since there are n'^~^ choices for Sm2, • • • j Smd, we have that \Tn2\ is 



Combining our estimates of Nn and |T„|, we have that 

M 

J2 PnD<n-'^\Tn\-Nn 
D=n+1 

and therefore "^^j^n+i P^D converges to zero as n approaches infinity. □ 
Combining Lemmas |7| and |S1 with Equation we have shown that 



J2 KD)PnD 



{d-m+1) 
D=l D=l 







as n ^ oo. This completes our error analysis and, together with Section 2, 
provides a complete proof of Theorem ^ 
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